Mining spatiotemporal video patterns towards robust action retrieval
نویسندگان
چکیده
In this paper, we present a spatiotemporal co-location video pattern mining approach with application to robust action retrieval in YouTube videos. First, we introduce an attention shift scheme to detect and partition the focused human actions from YouTube videos, which is based upon the visual saliency [13] modeling together with both the face [35] and body [32] detectors. From the segmented spatiotemporal human action regions, we extract 3D-SIFT [17] detector. Then, we quantize all detected interest points from the reference YouTube videos into a vocabulary, based on which assign each individual interest point with a word identity. An APrior based frequent itemset mining scheme is then deployed over the spatiotemporal co-located words to discover co-location video patterns. Finally, we fuse both visual words and patterns and leverage a boosting based feature selection to output the final action descriptors, which incorporates the ranking distortion of the conjunctive queries into the boosting objective. We carried out quantitative evaluations over both KTH human motion benchmark [26], as well as over 60-hour YouTube videos, with comparisons to the state-of-the-arts. Crown Copyright & 2012 Published by Elsevier B.V. All rights reserved.
منابع مشابه
Retrieval Method for Video Content in Different Format Based on Spatiotemporal Features
In this paper a robust video content retrieval method based on spatiotemporal features is proposed. To date, most video retrieval methods are using the character of video key frames. This kind of frame based methods is not robust enough for different video format. With our method, the temporal variation of visual information is presented using spatiotemporal slice. Then the DCT is used to extra...
متن کاملSocial Video Retrieval: Research Methods in Controlling, Sharing, and Editing of Web Video
Content-based video retrieval has been a very efficient technique with new video content, but it has not regarded the increasingly dynamic interactions between users and content. We present a comprehensive survey on user-based techniques and instrumentation for social video retrieval researchers. Community-based approaches suggest there is much to learn about an unstructured video just by analy...
متن کاملMining spatiotemporal patterns in dynamic plane graphs
Dynamic graph mining is the task of searching for subgraph patterns that capture the evolution of a dynamic graph. In this paper, we are interested in mining dynamic graphs in videos. A video can be regarded as a dynamic graph, whose evolution over time is represented by a series of plane graphs, one graph for each video frame. As such, subgraph patterns in this series may correspond to objects...
متن کاملSemantic retrieval of events from indoor surveillance video databases
With the existence of “semantic gap” between the machine-readable low level features (e.g. visual features in terms of colors and textures) and high level human concepts, it is inherently hard for the machine to automatically identify and retrieve events from videos according to their semantics by merely reading pixels and frames. This paper proposes a human-centered framework for mining and re...
متن کاملTemporal Patterns in Bot Activities
Correlated or synchronized bots commonly exist in social media sites such as Twitter. Bots work towards gaining human followers, participating in campaigns, and engaging in unethical activities such as spamming and false click generation. In this paper, we perform temporal pattern mining on bot activities in Twitter. We discover motifs (repeating behavior), discords (anomalous behavior), joins,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Neurocomputing
دوره 105 شماره
صفحات -
تاریخ انتشار 2013